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ABSTRACT The maturation of SARS coronavirus involves the autocleavage of polyproteins 1a and 1ab by the main protease 
(Mpro) and a papain-like protease; these represent attractive targets for the development of anti-SARS drugs. The functional unit 
of Mpro is a homodimer, and each subunit has a His-41---Cys-145 catalytic dyad. Current thinking in this area is that Mpro dimer- 
ization is essential for catalysis, although the influence of the substrate binding on the dimer formation has never been explored. 
Here, we delineate the contributions of the peptide substrate to Mpro dimerization. Enzyme kinetic assays indicate that the mono- 
meric mutant R298A/L exhibits lower activity but in a cooperative manner. Analytical ultracentrifugation analyses indicate that in 
the presence of substrates, the major species of R298A/L shows a significant size shift toward the dimeric form and the mono- 
mer-dimer dissociation constant of R298A/L decreases by 12- to 17-fold, approaching that for wild-type. Furthermore, this 
substrate-induced dimerization was found to be reversible after substrates were removed. Based on the crystal structures, 
a key residue, Glu-166, which is responsible for recognizing the Gin-P1 of the substrate and binding to Ser-1 of another protomer, 
will interact with Asn-142 and block the S1 subsite entrance in the monomer. Our studies indicate that mutation of Glu-166 in the 
R298A mutant indeed blocks the substrate-induced dimerization. This demonstrates that Glu-166 plays a pivotal role in connect- 
ing the substrate binding site with the dimer interface. We conclude that protein-ligand and protein-protein interactions are closely 


correlated in Mpro. 


INTRODUCTION 


Severe acute respiratory syndrome (SARS) is an emerging 
infectious disease of this century and is caused by a novel 
coronavirus (CoV) termed SARS-CoV (1). During the 
outbreak in 2003, this virus infected >8000 people and the 
fatality rate in humans was as high as 15% (World Health 
Organization). In 2004 and 2005, the discovery of two 
more species of CoV that infect humans, NL-63 and 
HCoV-HKU1, confirm the high mutating rate and genetic 
recombination within Coronaviridae (2,3). The fact that the 
virus is easily transmitted between humans makes the ree- 
mergence of SARS and other human CoVs a distinct possi- 
bility and has resulted in an urgent need to understand these 
viruses and their coding proteins. 

SARS-CoV main protease (Mpro) cleaves the virion poly- 
proteins (ppla and 1b) at 11 sites that contain the canonical 
L-Q- | -(A/S) sequence (4,5). Mpro was the first of the 
SARS-CoV proteins to have its three-dimensional structure 
solved by crystallography (6). Recently, structures of other 
CoV Mpros such as TGEV, IBV, and HCoV-HKUI1 have 
also been solved (7-9). Although the overall sequence iden- 
tity of these Mpros is only 40-50%, the three-dimensional 
structures, especially the regions of dimer interface, catalytic 
dyad, and substrate binding site, are highly conserved (10). 
Therefore, the design of broad-spectrum inhibitors of Mpro 
appears to be feasible for drug development (11). 
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Mpro is a homodimer in which the two subunits are ar- 
ranged perpendicular to each other. Each protomer 
comprises three distinct structural domains. The first two 
domains (residues 8-101 for domain I and residues 102— 
184 for domain II) have an antiparallel G6-barrel structure, 
which forms a folding scaffold similar to other viral chymo- 
trypsinlike proteases (7,12,13). Each subunit has its own 
substrate binding site consisting of a His-41- --Cys-145 cata- 
lytic dyad located at the interface between domains I and II 
(Fig. 1). An oxyanion hole is formed by the main-chain 
amides of Gly-143, Ser-144, and Cys-145 (6). Interestingly, 
Mpro contains an extra domain (III), which consists of five 
a-helices (residues 201-306), and this is a specific feature 
of CoV Mpro (6-9). 

The catalytic N-terminal domain and C-terminal domain 
If can fold independently. The N-terminal domain is 
a monomer that is able to fold correctly but is catalytically 
inactive (14). The extra domain III increases the structural 
stability of the catalytic domain by increasing the folding 
rate. Furthermore, domain III is involved in the dimerization 
of Mpro, which has important functional implications for 
this enzyme (15). The side chain of Arg-4 at the N-finger 
(residues 1—7) of protomer A fits into a pocket of protomer 
B and forms a salt bridge with Glu-290; this constitutes 
one of the major interactions between the two subunits 
(16). Our previous studies have shown that N-terminal trun- 
cation of the whole N-finger results in almost complete loss 
of enzymatic activity (17). Critical N-terminal amino acid 
residues up to Arg-4 and C-terminal residues up to Gln- 
299 have been identified as involved in dimerization and 
thus in generating the correct conformation of the active 
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FIGURE 1 Active center of the SARS-CoV Mpro. The interactions 
between the P1 substrate-binding subsite from chain A (cyan) and the 
N-finger and domain HI from chain B (magenta) of SARS-CoV Mpro 
(PDB code 1UK4) are shown. The substrate analog and the side chain of 
the Gln-P1 residue are yellow. The R298A monomeric structure (green) 
(PDB code 2QCY) is superimposed on the wild-type chain A. The dashed 
lines indicate hydrogen bonds for 1UK4 (red) and 2QCY (black). This figure 
was produced using PyMOL (35). 


site (17,18). In addition, the interactions between the two 
helices A’ (residues 11-15) and the S1 substrate-binding 
subsite, consisting of Phe-140, His-163, Met-165, Glu-166, 
and His-172, are also regarded as major components of the 
dimer interface (19,20). 

Compared with functional dimeric Mpro, the crystal struc- 
tures of monomeric Mpro (G11A, S139A, and R298A) have 
provided direct structural evidence for the catalytic incompe- 
tence of the dissociated monomer (19-21). In the monomer 
mutants, the oxyanion loop (Ser-139 to Leu-141) is con- 
verted into a short 3,9-helix and completely collapses 
inward, as exemplified by the large movement of Asn-142 
and Leu-141. The slipped Asn-142 interacts with the side 
chain of Glu-166 and blocks entry to the S1 subsite; this 
results in enzyme inactivation (Fig. 1) (20). All experimental 
results have indicated that the dimerization of Mpro is essen- 
tial for catalysis. Here, we provide what we believe is a novel 
approach that shows the influence of substrate binding on 
Mpro dimerization. Various biochemical and biophysical 
techniques are used to demonstrate the importance of 
substrate-induced dimerization to the catalytic mechanism 
of Mpro. By mutagenesis studies, a key residue, Glu-166, 
is found to play a connecting role between the substrate 
binding site and the dimer interface. We believe that this 
study will help deepen our understanding of the correlation 
between protein-ligand and protein-protein interactions in 
Mpro. Such an understanding will aid the development of 
new approaches to control SARS-CoV and other CoVs. 
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MATERIALS AND METHODS 
Preparation of recombinant SARS-CoV Mpro 


The construction of the expression plasmids for wild-type Mpro, R298A, 
R298L, and R298A/Q299A mutants have been described previously 
(16,18). For the site-directed mutagenesis (22) of E166A, the forward primer 
was 5/-tgctatatgcatcatatggcgcttccaacaggagtacac and the reverse primer was 
5'-gtgtactcctettgeaagceccatatgatgcatatag. The polymerase chain reaction 
products were then treated with DpnI and transformed into Escherichia 
coli cells and checked by autosequencing. Next, the plasmids were trans- 
formed into FE. coli strain BL21 (DE3) cells. The cells were grown in 
Luria-Bertani medium with 50 ug/ml Kanamycin at 37°C. After 3 h incuba- 
tion, the cells were induced overnight by 0.4 mM isopropyl-1-thio-6-D- 
galactoside at 20°C. For purification, all operations were performed at 
4°C. The cells were centrifuged at 6000 x g for 10 min. The supernatant 
was removed and the pellets were resuspended in the lysis buffer (20 mM 
Tris-Cl, 250 mM NaCl, 2 mM 6-mercaptoethanol (G-ME), and 0.2% Triton 
X-100, pH 8.5). The cells were then sonicated using sixty 10-s bursts at 
300 W with a 10-s cooling period between each burst. The cell debris was 
removed by centrifugation (10,000 x g for 25 min). Lysis buffer-equili- 
brated Ni-NTA slurry (Qiagen, Hilden, Germany) (1 ml) was then added 
to the soluble lysate, followed by gentle mixing for 1 h. The mixture was 
then loaded into a column and washed with the washing buffer (20 mM 
Tris-Cl, 250 mM NaCl, 8 mM imidazole, and 2 mM 6-ME, pH 8.5). Finally, 
the protease was eluted with elution buffer (20 mM Tris-Cl, 30 mM NaCl, 
150 mM imidazole, and 2 mM 6-ME, pH 8.5). 

The purified enzyme was concentrated and the buffer replaced using phos- 
phate-buffered saline (20 mM sodium phosphate buffer, 150 mM NaCl, and 
2 mM 6-ME, pH 7.6) and an Amicon Ultra-4 centrifugal filter with a mass 
cutoff at 10 kDa (Millipore, Bedford, MA). Samples from this purification 
step were subjected to SDS-PAGE to check for homogeneity (>95% purity). 
Typical yields of the enzyme after purification were 5—10 mg from 1 liter of 
E. coli culture medium. 


Enzyme kinetic assay 


The enzymatic activity of Mpro was measured by a colorimetry-based 
peptide cleavage assay involving the 6-mer peptide substrate TSAVLQ- 
para-nitroanilide (TQ6-pNA) (purity 95-99% by high-performance liquid 
chromatography; GL Biochem, Shanghai, China) (23). This substrate is 
cleaved at the GIn-pNA bond to release free pNA, which turns the solution 
color to yellow. The increase in absorbance at 405 nm was continuously 
monitored using a Jasco V-550 UV/VIS spectrophotometer (Tokyo, Japan). 
The amount of pNA released from the proteolysis can be calculated with 
a standard curve generated by analytical-grade pNA, which is consistent 
with the absorbance reported in the literature (A4o5 nm = 9.8 at 1 mM) (24). 

The protease activity assay was performed in 10 mM phosphate (pH 7.6) 
at 30°C. The substrate stock solution was 1 mM and the working concentra- 
tions were 5—950 uM. In the substrate titration assay, the concentrations of 
Mpro, E166A, and R298A/L mutants were 1, 5.7, and 2.6 uM, respectively. 
The steady-state enzyme kinetic parameters were obtained by fitting the 
initial velocity (vo) data to the Michaelis-Menten equation (Eq. 1): 


_ Keat[E] [S| 
” = Ke + " 


where ka; 1S the catalytic constant, [E] is the enzyme concentration, [S] is the 
substrate concentration, and K,, is the Michaelis constant of the substrate. 
The program SigmaPlot (Systat Software, Richmond, CA) was used for 
the data analysis. 

For the cooperativity effect, the kinetic parameters were obtained by 
fitting the initial velocities to the Hill equation (Eq. 2): 
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where K’ is a constant that relates to the dissociation constant and h is the 
Hill constant. 

The dependence of the proteolytic activity on enzyme concentration was 
investigated for the wild-type Mpro and R298A/L mutants, at a substrate 
concentration of 600 wM TQ6-pNA. The initial velocity of the reaction at 
various concentrations of each enzyme was determined and fitted to the 
nonlinear dependence equation (Eq. 3): 


Vo = Kat (K: ar A|E] = \/ Kj str 8KifE)) /8, (3) 


where Kg, is the monomer-dimer dissociation constant. 


Analytical ultracentrifugation analysis 


The analytical ultracentrifugation (AUC) experiments were performed on an 
XL-A analytical ultracentrifuge (Beckman, Fullerton, CA) with an An-50 Ti 
rotor (17). The sedimentation velocity (SV) experiments were performed in 
a double-sector epon charcoal-filled centerpiece at 20°C with a rotor speed 
of 42,000 rpm. The sample (330 wl) and reference (370 wl) solutions with or 
without different concentrations of TQ6-pNA substrate were loaded into the 
centerpiece. We found that the TQ6-pNA was cleaved and free pNA was 
released in the process of centrifugation (detected by absorbance at 
405 nm). The absorbance spectrum of free pNA interfered with protein 
absorbance at 280 nm. Therefore, absorbance at 250 nm was chosen to 
detect the protein, which was monitored in a continuous mode with a time 
interval of 480 s and a step size of 0.003 cm. Three different protein concen- 
trations (from 1.4 to 57.2 uM) were used to estimate the dynamic monomer- 
dimer content. Multiple scans at different time intervals were then fitted to a 
continuous c(s) distribution model using the SEDFIT program (25,26). The 
partial specific volume of Mpro, the solvent density, and the viscosity were 
calculated by SEDNTERP (http://www. jphilo.mailway.com/download.htm) 
(cited Oct. 20, 2009). 

The sedimentation equilibrium (SE) experiments were performed in a six- 
channel centerpiece. Three different samples (0.10—0.12 ml) were loaded 
into the sample channels and 0.11—13 ml buffers were loaded into the refer- 
ence channels. The cells were then loaded into the rotor and run at multi- 
speeding (8000, 12,000, and 15,000 rpm), each for 12 h at 20°C. Ten scans 
of absorbance at 250 nm at time intervals of 10 min were measured for every 
rotor speed to check the status of SE. In our studies, all Mpro and its mutants 
were able to achieve equilibrium state after 12 h. The SV results at three 
protein concentrations and the multispeed SE data were then globally 
analyzed using a monomer-dimer equilibrium model by the SEDPHAT 
program (27), which gives a precise measurement for Kg and the dissociation 
rate constant (Kor) (18,28,29). 


Analytical size-exclusive chromatography 


Size-exclusive chromatographic experiments were performed using a GE 
Healthcare AKTA purifier system (Pittsburgh, PA) with a Superose 12 
(10/300) column preequilibrated with phosphate-buffered saline (pH 7.6). 
Mpro and its mutants without or with 600 uM of substrate preincubation 
for 20 min were injected into the column separately. The elution was carried 
out at a flow rate of 0.5 ml/min and the absorbance at 280 nm was monitored 
continuously. 


Isothermal titration calorimetry 


The isothermal titration calorimetry (ITC) protocol followed was that of 
Sondermann et al. (30) with some modifications. Apparent dissociation 
constants and stoichiometry of the enzyme-ligand interactions were 
measured by a thermal activity monitor (2277, TA instruments, New Castle, 
DE). Calorimetric titrations of the peptide substrate TQ6-pNA (1 mM in 
a 250-ul syringe) and Mpro (5.7 uM for wild-type and 28.6 wM for mutants 
in a 4-ml ampoule) were carried out at 25°C in 10 mM phosphate buffer 
(pH 7.6). The peptides were titrated into the enzyme in 10-1 aliquots per 
injection with a time interval of 20 min. A control experiment in the absence 
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of enzyme was performed in parallel to correct for the dilution of heat. The 
data were then analyzed by integrating the heat effects normalized to the 
amount of injected proteins using curve fitting based on a 1:1 binding model. 
This involved the use of Digitam software (TA instruments). 


RESULTS AND DISCUSSION 


Cooperative effect of initial velocity curves 
of R298A and R298L mutants 


To measure the activity of Mpro and its mutants, we used the 
6-mer substrate peptide (TSAVLQ) attached to a pNA group 
(23). This peptide is specifically cleaved by SARS-CoV 
Mpro at the designated site (Gln-pNA) to release free 
pNA, which results in an increased absorbance at 405 nm. 
Besides wild-type Mpro as a dimeric target, two single 
mutants, R298A and R298L, and one double mutant, 
R298A/Q299A, were chosen as the monomeric targets. Ac- 
cording to previous studies (19), mutation of R298 or Q299 
will induce dimer dissociation and result in an ~10-fold 
decrease in proteolytic activity (18). When R298 and Q299 
are both mutated, the enzyme activity decreases by 100-fold. 
Other studies also suggest that the monomeric Mpro by other 
mutations shows very low or no _ proteolytic activity 
(16—21,31). Indeed, in this study, we were not able to detect 
any enzyme activity associated with the R298A/Q299A 
double mutant. However, for the single mutants, R298A or 
R298L, the initial velocity pattern at various substrate 
concentrations displayed a sigmoid curve (Fig. 2, B and C). 
The dimeric Mpro, on the other hand, exhibited a classical 
saturation curve (Fig. 2 A). These results were then fitted 
to the Michaelis-Menten or Hill equations to evaluate the 
kinetic parameters. The best-fit results are shown in Table 1. 
The K,, (223 uM) and kat (0.63 s~') of wild-type Mpro with 
TQ6-pNA substrate are close to observations from other 
laboratories (23,32). After fitting to the Hill equation, the 
Keat Of R298A was sixfold lower than that of the wild-type 
enzyme. In contrast, R298L showed a k.., close to that of 
the wild-type enzyme. The Hill constants of R298A and 
R298L were 2.0 and 1.8, respectively. This nonunity number 
suggested that there is a strong positive cooperativity among 
the Mpro protomers. However, since the cooperativity 
phenomenon is not compatible with a monomeric form, we 
sought other evidence to examine the possibility of dimeric 
Mpro formation during the catalytic process. 


Nonlinear dependence of initial velocity 
on protease concentration 


The dependence of the initial velocity on protease concentra- 
tion was analyzed (Fig. 2, D—F). If the monomeric Mpro has 
an activity identical to that of the dimeric one, a linear pattern 
should be obtained (28). However, a nonlinear positive 
correlation was observed for the wild-type and R298A/L 
monomeric mutants. After fitting to the nonlinear depen- 
dence equation (Eq. 3), the ka (Table 1) and Kg for the 
monomer-dimer equilibrium (Table 2) were calculated. 
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FIGURE 2 Initial velocity patterns of Mpro (A—C) and 
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The k., for wild-type and the mutants did not show any 
significant difference, whereas the Kg values for R298A/L 
were increased from 14- to 17-fold, indicating that the 
R298A/L mutants are more easily dissociated. The nonlinear 
dependence detected here suggests that proteolytic activity is 
actually related to the dimer content, which in turn is influ- 
enced by the protein concentration. Based on this, we next 


TABLE 1. Kinetic parameters of Mpro and its mutants 
Keat (8 ') by 
Keat! Km nonlinear 
Protein Kym (uM)* ke (s-' M7!) fitting! 
Wild-type 222.6 + 19.0 0.63 + 0.02 2830 + 303 14 + 0.2 
E166A 353.4 + 48.0 0.31 + 0.01 877 + 132 ND? 
K' (10° uM) eat (S") h 
R298A 49 223 0.10 + 0.004 2.0 + 0.1 0.7 + 0.4 
R298L 95 + 6.2 0.48 + 0.06 1.8 + 0.1 23 = OY 


*Data of wild-type Mpro and E166A mutant were fitted to the Michaelis- 
Menten equation (Eq. 1) and the R,,, values were 0.994 and 0.987, respec- 
tively. Those of mutants were fitted to the Hill equation (Eq. 2), and the Rygr 
values were 0.998 and 0.996, respectively. All the assays were repeated 
several times to ensure reproducibility. 

‘Catalytic constant was calculated by the nonlinear dependence equation 
(Eq. 3). The best-fitting R,, values for wild-type, R298A, and R298L 
were 0.995, 0.99 and 0.993, respectively. 

‘Not determined. 
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dependence of the initial velocity of Mpro on enzyme 
concentration (D—F’). (A—C) Plots of the velocity difference 
at various substrate concentrations for wild-type, R298A, 
and R298L mutants, respectively. The lines represent 
results fitted according to the Michaelis-Menten equation 
(Eq. 1) for wild-type and the Hill equation (Eq. 2) for 
mutants. The kinetic parameters are shown in Table 1. 
(D-F) Difference in velocity at various enzyme concentra- 
tions for wild-type, R298A, and R298L mutants, respec- 
tively. The concentration of substrate was 600 uM. The 
line represents the best fit to the nonlinear dependence 
equation (Eq. 3). 
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performed AUC experiments on Mpro with or without the 
presence of peptide substrates to evaluate the quaternary 
structural changes during the catalytic process. 


Dimeric R298A and R298L mutants in the 
presence of peptide substrates can be 
detected by AUC 


Interestingly, wild-type Mpro and the R298 mutants dis- 
played different monomer-dimer equilibrium with or without 
substrates. A typical SV experiment of AUC is shown in 
Fig. 3 A. The cumulative spectra were analyzed using the 
continuous size distribution model (26), which shows the 
quaternary structure distribution and the sedimentation coef- 
ficients (S) (Fig. 3, B—E). Although wild-type Mpro main- 
tained a stable dimeric form, the R298A/Q299A double 
mutant existed exclusively in monomeric form irrespective 
of the presence of high concentrations of substrate (Fig. 3, 
B and E£). Here, we confirmed that the monomeric and 
dimeric Mpro sedimented at 2.9 and 4.1 S, respectively. 
These values are consistent with previous observations 
(16,18,21). 

However, a different story was found for the R298 single 
mutants. In the presence of substrates, the R298A and R298L 
mutants showed a significant shifting of the major species 
(Fig. 3, C and D). At 600 uM TQ6-pNA, the major species 


Substrate-Induced Dimerization of Mpro 


TABLE 2 Dissociation of Mpro and its mutants with and without substrates 


Nonlinear fitting 


No substrate 


Protein Ka (uM)* Ka (uM)! 
Wild-type 0.8 + 0.4 2.0 + 0.01 
R298A 11.4 +97 81.1 + 3.3 
R298L 13.8 + 8.7 30.8 + 0.4 
R298A/Q299A ND° 85.7 + 3.1 
E166A ND 37+ 02 
E166A/R298A ND 141.1 + 12.7 


*Kq was calculated by nonlinear dependence equation (Eq. 3). 
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AUC analysis 
With 600 uM substrate 
Kore (8 ')! Ka (uM)! Kote (8 ')! 

0.1 + 0.007 1.7 + 0.03 0.1 + 0.002 
0.1 + 0.004 4.7+ 0.1 0.08 + 0.001 
0.1 + 0.001 2.6 + 0.1 0.1 + 0.003 
0.1 + 0.004 115.0 + 4.4 0.05 + 0.002 
0.04 + 0.001 0.4 + 0.01 0.1 + 0.002 
0.1 + 0.009 103.7 + 8.3 0.06 + 0.005 


‘Parameters were derived from a global fit of the SV and SE data to a monomer-dimer self-association model by SEDPHAT. The SE experiments for the assay 
were obtained at protein concentrations of 5.7—28.6 uM, whereas SV experiments were at 1.4—57.2 uM. 


of R298A/L shifted from 2.9 S to 3.8—3.9 S. The distribution 
now became a broad peak between the monomer and dimer, 
but closer to the dimer. In a fast dissociated and associated 
system, a broad size distribution will appear between the 
monomer and dimer when the protein concentration is close 
to the Kg (27). Thus, SARS-CoV Mpro acts as a rapid mono- 
mer-dimer self-association equilibrium system. Furthermore, 
we also measured the size distribution of R298A at various 
substrate concentrations. The results clearly indicated that 
the distance of major species shifting was related to the 
substrate dosage (Fig. 4 A). More substrates resulted in 
more of the dimer form and the major species became closer 
and closer to the dimer. The trend was a linear positive corre- 
lation (Fig. 4 B). Therefore, we conclude that the presence of 
substrate can induce a quaternary structural change in the 
R298A/L mutants. The system described in this article 
prevents any further monomer-dimer equilibrium analysis 
of the broad size distribution among the monomer and dimer 
species. We therefore performed the next set of experiments 
to further delineate the monomer-dimer equilibrium. 


Monomer and dimer distribution of R298A/L 
mutants can be clearly delineated in deuterium 
water 


To clearly separate the monomeric and dimeric signals, the 
speed of the protein self-association needs to be slowed 
down. To do this, we performed the same SV experiments 
in deuterlum water (DO), which has a higher density 
(Fig. Sl A in the Supporting Material). All the buffer solu- 
tions and substrate stocks were prepared in D,O to give 
the same ionic strengths. In the D,O environment, the mono- 
meric and dimeric Mpro sedimented at 1.9 and 2.6 S, respec- 
tively (Fig. Sl, B and £). For the R298A mutant, there was a 
single monomeric form in the absence of substrate and a 
single dimeric form in the presence of substrate (Fig. S1 C). 
In addition, under the same conditions, both monomeric and 
dimeric forms of R298L could be observed (Fig. S1 D). This 
demonstrates that the high density of D,O is able to slow 
down the exchange rate within the monomer-dimer equilib- 
rium and clearly separate the signals from the two species. 


We next analyzed the size distribution of the R298A mutant 
at various substrate concentrations in D,O (Fig. S2 A). The 
results clearly indicate that the increase in dimeric form 
content is closely correlated with the substrate concentration 
(Fig. S2 B). 

In summary, the measurements obtained by the SV exper- 
iments proved that the R298A/L mutants are able to associate 
into a dimeric form during the catalytic process. This 
substrate-induced dimerization is dose-dependent. 


Dramatic decrease in dissociation constant 
in the presence of peptide substrate 


To quantitatively characterize the monomer-dimer equilib- 
rium of Mpro and its mutants, SV experiments were carried 
out at the three different enzyme concentrations with or 
without 600 uM TQ6-pNA. In addition, SE experiments 
were executed. The three SV and one SE datasets were then 
globally fitted to the monomer-dimer self-association model 
by SEDPHAT to estimate the Kg and Ko¢,. The best-fit results 
are shown in Table 2. The ko¢¢ values for wild-type Mpro and 
its three mutants were all close to 0.1 s~!, which indicates that 
the monomer-dimer exchange is instantaneous (27). The Kg 
values of wild-type Mpro without and with substrates were 
2.0 and 1.7 uM, respectively, when those for the R298A/ 
Q299A double mutant were 85 and 115 uM. Not surprisingly, 
the Kg of the R298A/L mutants was dramatically decreased, 
by 12- to 17-fold, in the presence of substrate, approaching 
the values for the wild-type (Table 2). The Kg rescued by 
substrates suggested that the R298A/L mutants display 
a protein association comparable to that of the wild-type in 
the presence of substrates. On the other hand, by nonlinear 
dependence fitting, the Kg of wild-type in the presence of 
substrates is 2.5-fold lower than in the absence of substrates 
(Table 2). The influence of substrates on the dimerization of 
wild-type is minor because of the stronger association 
between its protomers. In contrast, the R298A/Q299A double 
mutant lost two important ion pairs (Ser-123 (A)-- -Arg-298 
(B) and Ser-139 (A)---Gln-299 (B)) in the dimer interface 
(18) and results in complete dissociation, even in the presence 
of substrates. 
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FIGURE 3 SV patterns of Mpro. (A) Typical trace of absorbance at 
250 nm of the enzyme during the SV experiment. Symbols represent exper- 
imental data and lines the results fitted to the Lamm equation using the 
SEDFIT program (25,26). (B—E) Continuous (c(s)) distribution of wild-type, 
R298A, R298L, and R298A/Q299A mutants, respectively. The residual 
bitmaps are shown in the insets. Protein concentration was 1.0 mg/ml. 
The distributions in 10 mM phosphate buffer are shown by solid lines and 
those in 10 mM phosphate containing 600 uM TQ6-pNA substrate by 
dashed lines. The left vertical dotted line indicates the monomer, and the 
right the dimer position. 
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FIGURE 4_ Effects of substrate concentration on the quaternary structure 
of the R298A mutant. (A) Continuous (c(s)) distribution of the R298A 
mutant at TQ6-pNA concentrations of O (solid circles), 20 (open circles), 
120 (solid triangles), 250 (open triangles), 400 (solid squares), and 
600 uM (open squares). (B) Sedimentation coefficient shift of R298A major 
species at different TQ6-pNA concentrations. 


Dimerization induced by peptide substrate 
is reversible 


To further evaluate whether the substrate-induced dimeriza- 
tion is a dynamic equilibrium or a permanent, irreversible 
process, we performed size exclusion chromatography on 
Mpro and its mutants. The protease was preincubated with 
or without 600 uM TQ6-pNA for 20 min and then injected 
into the Superose 12 column (Fig. 5). The running buffer 
was substrate-free. The results showed that after removing 
the substrate, wild-type Mpro still maintained a dimeric 
form, but the three mutants were all monomeric. This 
suggests that substrate-induced dimerization is a reversible 
process. When the substrate leaves, the R298A/L dimers 
dissociate back to being a monomer. A similar result was 
obtained when the substrates were removed by the HiTrap 
desalting column (GE Healthcare, Pittsburgh, PA) and then 
analyzed by AUC (Fig. S3). 
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FIGURE 5_ Size-exclusion chromatography of Mpro. The enzyme in 
phosphate-buffered saline buffer was applied to a preequilibrated Superose 
12 10/300 GL column. The Mpro and its mutants, without or with preincu- 
bation in 600 uM substrate, are wild-type (solid circles), wild-type with 
substrate (open circles), R298A (solid triangles), R298A with substrate 
(open triangles), R298A/Q299A (solid squares), and R298A/Q299A with 
substrate (open squares). 


ITC assay showed that both dimeric and 
monomeric Mpro can bind to the peptide 
substrates 


Based on the structure of the SARS-CoV Mpro with its 
substrate analog (PDB code 1UK4), the substrate can bind 
by different modes to the active and inactive protomer (6). 
With the active protomer, the side chain of Gln-P1 accepts 
hydrogen bonding from the side chain of His-163 and Glu- 
166, resulting in specificity of Gln-P1 for the S1 subsite 
(Fig. 1). On the other hand, with the inactive protomer, the 
S1 pocket is not opened and thus does not allow the Gln- 
Pl to enter. Instead, Leu-P2 and Ser-P4 bind to the appro- 
priate specific pocket. These differences imply that the 
substrate binding site is flexible and can undergo a significant 
conformational change. 

To further delineate the binding of substrate to the Mpro 
and its mutants, we employed ITC to measure the dissocia- 
tion constant (Kg for the substrate-enzyme complex), the 
binding enthalpy change (AH), and the binding stoichiom- 
etry (NV) of the substrate-enzyme interaction. In the ITC 
experiments, 1 mM TQ6-pNA was injected into the protein 
solution and curve fitting based on a 1:1 binding model 
was carried out (Fig. 6). Compared with wild-type Mpro 
(Fig. 6 A), all three mutants (Fig. 6, B—D) exhibited a similar 
N and Kg. The N values are close to 1, which means that one 
substrate may bind one enzyme molecule. The Kg of 
substrate to wild-type Mpro was 34 uM, whereas those of 
substrate to R298A, R298L, and R298A/Q299A mutants 
were 81, 114, and 30 uM, respectively. However, the AH 
values for substrate binding to Mpro mutants were decreased 
by three- to sixfold compared with substrate binding to wild- 
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type. Although this might be an artifact, because the enzy- 
matic hydrolysis during the titration might produce addi- 
tional heat that will contribute to the overall AH, the AH, 
N, and Kg values from our ITC observations still suggest 
that the wild-type Mpro and its mutants, especially the 
monomeric R298A/Q299A mutant, have comparable sub- 
strate-binding affinities. 


Mechanism for substrate-induced dimerization 
of SARS-CoV Mpro 


On the basis of the solved monomeric structures (PDB codes 
2QCY, 2PWX, and 3F9E), the loop of Ser-139 to Leu-141 is 
able to form a 3,9-helical structure, which results in an Asn- 
142 shift by 5.8 A (20). This shift helps the side-chain amide 
of Asn-142 to accept hydrogen bonding from the side chain 
of Glu-166 (Fig. 1). In the dimeric Mpro (PDB code 1UK3), 
the side chain of Glu-166 interacts with the main-chain 
amide of Ser-1 from protomer B (6). In the substrate-bound 
form (PDB code 1UK4), in protomer A, Gln-P1 enters the S1 
binding pocket and its side-chain amide and carbony]! donate 
a hydrogen bond to the side chains of Glu-166 and His-163, 
respectively. Comparing these structures, the Glu-166 
residue should play a distinct role. In the dimer, Glu-166 
binds to the GlIn-P1 of the substrate and the N-finger of pro- 
tomer B. In the monomer, it shifts to interact with Asn-142, 
which is able to stabilize the 3, 9-helix of Ser-139 to Leu-141. 

To obtain direct evidence for the role of Glu-166, we 
mutated and purified El166A and E166A/R298A mutants 
and then performed the kinetic and quaternary structure anal- 
yses. The E166A single mutant had an ~3-fold loss in 
enzyme activity based on Kk a/K,, (Table 1), whereas the 
enzyme activity of the E166A/R298A double mutant was 
not detectable. Furthermore, the experimental data from 
ITC for the E166A point mutant and the E166A/R298A 
double mutant suggest that these two mutants have compa- 
rable substrate binding affinities (Fig. S4). This indicates 
that without Glu-166, the substrate can still bind to the active 
site. On the other hand, the SV experiments suggest that 
E166A is a dimer with or without substrates (Fig. 7 A). 
Furthermore, the Kg of E166A was 3.7 uM and decreased 
to 0.4 uM in the presence of substrates, which was very close 
to the Kg for wild-type (Table 2). It is reasonable in that Glu- 
166 (A)-:-Ser-1 (B) is not the main force stabilizing the 
dimer interface. However, when Glu-166 and Arg-298 
were both mutated, the dimerization was not detected, and 
the Kg did not decrease in the presence of substrates 
(Fig. 7 B and Table 2). The results suggest that mutation 
of Glu-166 in R298A mutant will block the substrate- 
induced dimerization. This demonstrates that Glu-166 plays 
a pivotal role in connecting the substrate binding site with the 
dimer interface. The highly conserved Glu-166 in most 
CoVs, including SARS CoV, MHV, TGEV, HCoV, and 
IBV (8), implies that involvement of this residue in the pro- 
tomeric communication may be universal in all virion Mpro. 
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To summarize the above observations, a schematic 
diagram showing the proposed catalytic process of SARS- 
CoV Mpro is presented in Fig. 8. The wild-type is a dimer, 
as shown in our AUC studies, but we are not very sure about 
the quaternary structure of Mpro under the highly diluted 
assay conditions (<1.4 uM). We assume that there is a rapid 
equilibrium between the monomer and dimer in solution. 
Mutation at Arg-298 will push the equilibrium to monomer. 
There is also a rapid equilibrium between the active and inac- 
tive protomers. Substrate binding to the protomer (either 
a monomer or a dimer) induces a conformational change 
of the enzyme that shifts the equilibrium from monomers 
to the catalytic-competent dimers. The protein-concentra- 
tion-dependent enzyme activity experiment indicated that 
only the dimer is enzymatically active. Arg-298 and Gln- 
299 are involved in this rate-limiting step for Mpro catalysis. 
When both residues are mutated, there will be no substrate- 
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FIGURE 6 Isothermal calorimetric titration for the 
substrate TQ6-pNA binding to Mpro and its mutants. 
(A—D) Binding of TQ6-pNA to wild-type, R298A, 
R298L, and R298A/Q299A mutants, respectively. Wild- 
type protein concentration was 5.7 uM and protein concen- 
tration of the mutants was 28.6 wM. The TQ6-pNA (1 mM) 
was titrated into 2.7 ml of protein solution using 25—30 
injections at a rate of 10 ul/injection. Values for the N, 
Kg, and AH were determined by ligand binding analysis 
with the Digitam program (TA instruments). The solid 
circles show the observed values and the lines represent 
fitted results. 


N = 0.97 + 0.01 
Kg = 29.9 + 2.1 uM 
AH = -191 + 2 kcal/mol 


induced dimerization. On the other hand, Glu-166 ts critical 
in the following subunit association step, which pushes the 
equilibrium to the catalytic-competent dimer formation. 
Then catalysis and release of products, followed by dissoci- 
ation and regeneration of the inactive protomer, complete the 
catalytic cycle. This mechanism is also compatible with 
many current and previous studies (18,21,33,34). 


CONCLUSION 


In this study, we have delineated the contributions of 
substrate binding to Mpro dimerization. Enzyme kinetic 
assays indicated a nonlinear upward dependence between 
the enzyme activity and enzyme concentration for wild- 
type and R298A/L mutants. The quaternary structure of 
R298A/L was also changed at different enzyme concentra- 
tions. The AUC analysis proved that in the presence of 
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FIGURE 7 Sedimentation velocity experimental values for Glu-166 
mutants E166A (A) and E166A/R298A (B) in D2O. The protein concentra- 
tion was 0.33 mg/ml. The distributions in DO are represented by solid lines 
and those in D2O with 600 uM TQ6-pNA substrate by dashed lines. The left 
vertical dotted line indicates the monomer position and the right, the dimer 
position. (/nsets) Residual bitmaps for mutants without (upper) and with 
(lower) substrate. 


peptide substrates, the major species of the R298A/L mutants 
showed significant size shifting toward the dimeric form, 
which became clear in the D,O environment. The substrate- 
induced dimerization of R298A/L was reversible upon 
removal of the substrates. Mutation of Glu-166 in R298A 
mutant significantly blocks the substrate-induced dimeriza- 
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FIGURE 8 Schematic model for the catalytic process of SARS-CoV 
Mpro. Substrate binding triggers the monomer-dimer equilibrium to favor 
the active protomer (square). The active protomer now is prompt to asso- 
ciate, forming the catalytic-competent dimer. After the catalytic cycle, the 
associated dimer will dissociate again to form the inactive protomer (circle). 
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tion. The connection between the substrate binding site and 
the dimer interface by Glu-166 may be universal in all CoV 
Mpro. Substrate binding induces and stabilizes enzyme 
dimerization, which then activates the enzyme molecule. 
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